FastStep: Scalable Boolean Matrix Decomposition
نویسندگان
چکیده
Matrix Decomposition methods are applied to a wide range of tasks, such as data denoising, dimensionality reduction, co-clustering and community detection. However, in the presence of boolean inputs, common methods either do not scale or do not provide a boolean reconstruction, which results in high reconstruction error and low interpretability of the decomposition. We propose a novel step decomposition of boolean matrices in non-negative factors with boolean reconstruction. By formulating the problem using threshold operators and through suitable relaxation of this problem, we provide a scalable algorithm that can be applied to boolean matrices with millions of non-zero entries. We show that our method achieves significantly lower reconstruction error when compared to standard state of the art algorithms. We also show that the decomposition keeps its interpretability by analyzing communities in a flights dataset (where the matrix is interpreted as a graph in which nodes are airports) and in a movie-ratings dataset with 10 million non-zeros.
منابع مشابه
Characteristic matrix of covering and its application to Boolean matrix decomposition
Covering-based rough sets provide an efficient theory to deal with covering data which widely exist in practical applications. Boolean matrix decomposition has been widely applied to data mining and machine learning. In this paper, three types of existing covering approximation operators are represented by boolean matrices, and then they are used to decompose into boolean matrices. First, we de...
متن کاملDiscovery of factors in binary data triangular decomposition of matrices
We present new methods of decomposition of an n ×m binary matrix I into a product A ∗ B of an n × k binary matrix A and a k×m binary matrix B. These decompositions are alternative to the usual one which is sought in Boolean factor analysis (BFA), where ∗ is a Boolean product of matrices. In the new decompositions, ∗ are the left and the right triangular products of Boolean matrices. In BFA, I i...
متن کاملOn Decomposing Systems of Boolean Functions via Ternary Matrix Cover Approach
The problem of two-block disjoint decomposition of completely specified Boolean functions is considered. Recently a good method in functional decomposition category was proposed. This method is based on using the ternary matrix cover approach. Due to investigation and analysis of this method and to search for an appropriate partition, a computer program was developed. After running the program ...
متن کاملAn Efficient FPGA Implementation of Scalable Matrix Inversion Core using QR Decomposition
We present a novel scalable architecture for matrix inversion that uses the modified Gram-Schmidt algorithm based on QR decomposition. Our core achieves a throughput of 0.18M updates per second for a 4 x 4 matrix using 19 bits of precision on a Xilinx Virtex4 SX FPGA. We also present two different designs which use longer data lines, 26 and 32 bits, and compare our results with another matrix i...
متن کاملTriadic Factor Analysis
This article is an extension of work which suggests using formal concepts as optimal factors of Factor Analysis. They discussed a method for decomposing a p × q binary matrix W into the Boolean matrix product P ◦ Q of a p × n binary matrix P and a n × q binary matrix Q with n as small as possible. We have generalised this factorization problem to the triadic case, looking for a decomposition of...
متن کامل